Natural-emotion GMM transformation algorithm for emotional speaker recognition

نویسندگان

  • Zhenyu Shan
  • Yingchun Yang
  • Ruizhi Ye
چکیده

One of the largest challenges in speaker recognition is dealing with speaker-emotion variability problem. Nowadays, compensation techniques are the main solutions to this problem. In these methods, all kinds of speakers’ emotion speech should be elicited thus it is not user-friendly in the application. Therefore the basic problem is how to get the distribution of speakers’ emotion speech and how to train emotion GMM from their natural speech. This paper presents a natural-emotion GMM transformation algorithm to train users’ emotion model to overcome this problem. The algorithm can convert natural GMM to emotion GMM based on an emotion database. It only needs speakers’ natural speech and needn’t to align the natural utterances with the emotion utterances. The performance evaluation is carried on the MASC database. The promising result is achieved compared to the traditional speaker verification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speech Emotion and Speaker Recognition based on Hybrid GMM and FFBNN

In this paper we present text dependent speaker recognition with an enhancement of detecting the emotion of the speaker prior using the hybrid FFBN and GMM methods. The emotional state of the speaker influences recognition system. Mel-frequency Cepstral Coefficient (MFCC) feature set is used for experimentation. To recognize the emotional state of a speaker Gaussian Mixture Model (GMM) is used ...

متن کامل

Speaker Recognition System Based on the Baseband Correlation Score Reliability Fusion

Emotion mismatch between training and testing will cause system performance decline sharply which is emotional speaker recognition. It is an important idea to solve this problem according to the emotion normalization of test speech. This method proceeds from analysis of the differences between every kind of emotional speech and neutral speech. Besides, it takes the baseband mismatch of emotiona...

متن کامل

Comparison between Gmm-svm Sequence Kernel and Gmm: Application to Speech Emotion Recognition

Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and...

متن کامل

Emotional Speaker Identification by Humans and Machines

This paper concerns the problem of the effect of emotion change on human and machine for speaker identification. A contrasting experiment is carried out between Automatic Speaker Identification (ASI) system (applying GMM-UBM and Emotional Factor Analysis (EFA) algorithm)and aural system on emotional speech corpus MASC. The experimental result is similar to that in channel-mismatched condition, ...

متن کامل

Emotion attribute projection for speaker recognition on emotional speech

Emotion is one of the important factors that cause the system performance degradation. By analyzing the similarity between channel effect and emotion effect on speaker recognition, an emotion compensation method called emotion attribute projection (EAP) is proposed to alleviate the intraspeaker emotion variability. The use of this method has achieved an equal error rate (EER) reduction of 11.7%...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007